The table shows the average word length (counted without multiplicity) for the most frequent 10n (n=1,2,…) words.
It is clear that the average word length increases for larger frequency ordered word lists, but the picture below shows a nearly linear increase. Here we used logarithmic scaling for the x-axis.
The table above allows a much finer comparison of the average word length (without multiplicity) for different languages.
Average word length for the top-1000 words:
SELECT 1000, avg(char_length(word)) FROM words WHERE w_id>100 and 1100>=w_id;
Explain the nearly linear growth in the above picture!
3.5.1.1 Words by Length without multiplicity
3.5.1.2 Words by Length with multiplicity